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"Express Mail" mailing label number EV 327682986US 

Date of Deposit: November 19, 2003 

Attorney Docket No. 15145US02 

REDUCTION OF MEMORY REQUIREMENTS BY DE- INTERLEAVING AUDIO 

SAMPLES WITH TWO BUFFERS 

CROSS-REFERENCE TO RELATED APPLICATIONS /INCORPORATION BY 

REFERENCE 

[0001] The present application claims the benefit of 
U.S. Provisional Application No. 60/516,195 entitled 
REDUCTION OF MEMORY REQUIREMENTS BY DE- INTERLEAVING AUDIO 
SAMPLES WITH TWO BUFFERS filed on October 31, 2003, the 
complete subject matter of which is hereby incorporated 
herein by reference in its entirety. 

FEDERALLY SPONSORED RESEARCH OR DEVELOPMENT 
[0002] [Not Applicable] 

MICROFICHE /COPYRIGHT REFERENCE 
[0003] [Not Applicable] 

BACKGROUND OF THE INVENTION 
[0004] Pursuant to the MPEG-2 Advanced Audio Coding 
{MPEG-2 AAC) standard, audio signals are sampled at 48K 
samples /second. The samples are grouped into consecutive 
frames of 1024 samples. Windowing is applied on a bloclc 
of audio samples. The block length ''N" could be either 
2048 or 256 samples. However, each window has a 50% 
overlap with the previous window. 
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[0005] Accordingly, the first N/2 samples of a window are 
the same as the last N/2 samples of the previous window. 
A window function is applied to each window, resulting in 
sets of 2048 or 256 windowed samples. A modified 
discrete cosine transformation is applied to each set of 
windowed samples, resulting in N/2 frequency 
coefficients. The frequency coefficients are then 

quantized and coded for transmission. 

[0006] The first step in decoding is to establish the 
frame synchronization. Once the frame synchronization is 
found, the AAC bitstream can be decoded to generate audio 
time domain samples. The decoding process includes 
Huffman decoding, scale factor decoding, and decoding of 
side information used in tools such as Mid/Side (M/S) , 
intensity stereo, TNS, and filter bank. The spectral 
samples are decoded and copied to the output buffer in a 
sampled fashion. 

[0007] After Huffman decoding, for example, each 
coefficient may be inverse quantized by a 4/3 power 
nonlinearity and then scaled by the quantizer step size. 
Finally, the Inverse MDCT (IMDCT) transforms the spectral 
coefficients into time domain. After the IMDCT 

transform, the output samples may be windowed, 
overlapped, and added for generating the final pulse code 
modulate (PCM) samples. 

[0008] Further limitations and disadvantages of 
conventional and traditional systems will become apparent 
to one of skill in the art through comparison of such 
systems with the invention as set forth in the remainder 
of the present application and with reference to the 
drawings appended hereto . 
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BRIEF SUMMARY OF THE INVENTION 
[0009] Aspects of the present invention may be found in a 
method of reducing memory requirements by de-interleaving 
audio information with static and dynamic buffers. The 
method may comprise writing interleaved first audio 
channel information to a first static memory device, de- 
interleaving the first audio channel information, and 
writing de- interleaved first audio channel information to 
a second static memory device - 

[0010] The method may also comprise writing de- 
interleaved first audio channel information to a dynamic 
memory device from the second static memory device, 
overwriting interleaved first audio channel information 
with interleaved second audio channel information in the 
first static memory device, de- interleaving second audio 
channel information, and writing de-interleaved second 
audio channel information to a second static memory 
device . 

[0011] The method may also comprise overwriting 
interleaved second audio channel information in the first 
static memory with de- interleaved first audio channel 
information from the dynamic memory device and decoding 
the first and second audio channel information . 

[0012] In another embodiment of the present invention, 
the first audio channel information and the second audio 
channel information may comprise interleaving, wherein 
interleaving may comprise associating similar audio 
information from adjacent frames. 
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[0013] In another embodiment of the present invention, 
similar audio information may comprise audio information 
corresponding to a particular range of audio frequencies. 

[0014] Another embodiment of the present invention may 

comprise maintaining audio quality of transmitted audio 
information by receiving the audio information in an 
interleaved state . 

[0015] Aspects of the present invention may also be found 
in a system for de-interleaving audio information with 
static and dynamic buffers. The system may comprise a 
first static memory device, a de-interleaving engine, a 
second static memory device, a dynamic memory device, and 
an audio decoder. The interleaved first audio channel 
information may be written into the first static memory 
device, de- interleaved by the de-interleaving engine, 
written into the second static memory device, and 
temporarily stored in the dynamic memory device. 

[0016] In another embodiment of the present invention, 
the interleaved first audio channel information may be 
overwritten in the first static memory by interleaved 
second audio channel information. The interleaved second 
audio channel information may be de- interleaved by the 
de- interleaving engine. The de-interleaved second audio 
channel information may be written to the second static 
memory device. 

[0017] In another embodiment of the present invention, 
interleaved second audio channel information in the first 
static memory may be overwritten by the de- interleaved 
first audio channel information from the dynamic memory 
device . 
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[0018] In another embodiment of the present invention, 
the de- interleaved first and second audio channel 
information may be decoded by the audio decoder. 

[0019] In another embodiment of the present invention, 

interleaved audio information may comprise similar audio 
information from adjacent frames being associated in 
ad j acent sub- frames . 

[0020] In another embodiment of the present invention, 
similar audio information may comprise audio information 
corresponding to a particular range of audio frequencies. 

[0021] In another embodiment of the present invention, 
audio quality of transmitted audio information may be 
maintained by receiving the audio information in an 
interleaved state . 

[0022] Aspects of the present invention may also be found 
; in an application specific integrated circuit (ASIC) for 

de-interleaving audio information- The circuit may 

comprise a first static memory device operatively 
connected to a de- interleaving engine, a second static 
memory device also operatively connected to the de- 
interleaving engine, a dynamic memory device operatively 
connected to the first and second static memory devices, 
and an audio decoder operatively connected to the first 
and second static memory devices. The interleaved first 
audio channel information may be written into the first 
static memory device, de-interleaved by the de- 
interleaving engine, written into the second static 
memory device, and temporarily stored in the dynamic 
memory device. 
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[0023] In another embodiment of the present invention, 
the interleaved first audio channel information may be 
overwritten in the first static memory by interleaved 
second audio channel information, the interleaved second 
audio channel information may be de-interleaved by the 
de-interleaving engine, and the de- interleaved second 
audio channel information may be written to the second 
static memory device. 

[0024] In another embodiment of the present invention, 
the interleaved second audio channel information in the 
first static memory may be overwritten by the de- 
interleaved first audio channel information from the 
dynamic memory device. 

[0025] In another embodiment of the present invention, 
the de-interleaved first and second audio channel 
information may be decoded by the audio decoder. 

[0026] In another embodiment of the present invention, 
interleaved audio information may comprise similar audio 
information from adjacent frames being associated in 
ad j acent sub- frames . 

[0027] In another embodiment of the present invention, 
similar audio information may comprise audio information 
corresponding to a particular range of audio frequencies. 

[0028] In another embodiment of the present invention, 
audio quality of transmitted audio information may be 
maintained by receiving the audio information in an 
interleaved state . 

[0029] These and other advantages and novel features of 
the present invention, as well as details of an 
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illustrated embodiment thereof, will be more fully 
understood from the following description and drawings. 
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BRIEF DESCRIPTION OF SEVERAL VIEWS OF THE DRAWINGS 
[0030] Figure 1 is a block diagram describing encoding of 
an audio signal according to an embodiment of the present 
invention; 

[0031] Figure lA is a block diagram describing 
interleaving of an encoded audio signal according to an 
embodiment of the present invention; 

[0032] Figure 2 is a block diagram of an exemplary AAC 
decoder in accordance with an embodiment of the present 
invention; 

[0033] Figure 3 is a block diagram illustrating a memory 
arrangement for reducing memory required to perform an 
audio process in accordance with an embodiment of the 
present invention ; 

[0034] Figure 4 is a block diagram illustrating a memory 
and audio decoder arrangement according to an embodiment 
of the present invention; and 

[0035] Figure 5 is a flow diagram illustrating a method 
for processing a multi-channel audio signal in accordance 
with an embodiment of the present invention. 
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DETAILED DESCRIPTION OF THE INVENTION 
[0036] Referring now to Figure 1, there is illustrated a 
block diagram 100 describing the encoding of an exemplary 
audio signal A(t) 110 according to an embodiment of the 
present invention. Pursuant to the MPEG-2 Advanced Audio 
Coding (MPEG-2 AAC) standard, the audio signal 110 may be 
sampled at 48K samples /second. The samples may be 
grouped into frames Fo...Fn of 1024 samples 120, e.g., 
Fx(0) . . .Fx (1023) - Windowing may be applied on a block of 
audio samples 130. The block length "N" could be either 
2048 or 256 samples. 

[0037] Each window Wx niay have a 5 0% overlap with the 
previous window Wx-i - Accordingly, the first N/2 samples 
of a window Wx may be the same as the last N/2 samples of 
the previous window Wx-i. A window function w(t) may be 
applied to each window Wo . . .Wn/ resulting in sets wWo • - .wWn 
of 2048 windowed samples 140, e.g., wWx ( 0 ) . . . wWx (2047 ) . 
For example, for a short block having 256 samples, 
windowing may be applied for each of the 256 samples, 
resulting in 2 048 samples, after windowing. 

[0038] A modified discrete cosine transformation (MDCT) 
may be applied to each set wWo...wWn of windowed samples 
wWx(O) . . .wWx(2047) , resulting in sets MDCTq - . - MDCTn of 1024 
frequency coefficients 150, e.g. , MDCTx(O) . . .MDCTx(1023) . 

[0039] The sets of frequency coefficients MDCTq - - -MDCTn 
may be quantized and coded for transmission, forming what 
is known as an audio elementary stream (AES) . The AES 
may be packetized and padded with extra headers to form 
an Audio Transport Stream (Audio TS) - Transport streams 
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may be multiplexed together, stored, and/or transported 
for playback on a playback device. 

[0040] The playback device may either be locally or 
remotely located. Where the playback device is remotely 
located, the multiplexed stream may be transported over a 
communication medium, such as the Internet. During 
playback, the multiplexed transport stream may be de- 
multiplexed, resulting in individual Audio Transport 
Streams. The Audio TS may be stripped of the additional 
headers. The constituent AES's may be decoded, resulting 
in the audio signal . 

[0041] Figure lA is a block diagram lOOA describing 
interleaving of an encoded audio signal according to an 
embodiment of the present invention. The frequency 
coefficients MDCTq - . . MDCTn 150A may be interleaved. In an 
audio translatable signal, each frame may comprise 
frequency coefficients MDCTq - - .MDCT1023 which may be 
further divided into a plurality of equal sub- frames 
160A, for example, 8 sub- frames, wherein each sub- frame 
may have 128 frequency coefficients contained therein. 
Sub- frame contents may correspond to a particular range 
of audio frequencies, 

[0042] For example, frameo may comprise 8 sub- frames 
organized as follows: sub-frame Oq may contain frequency 
coefficients 0 through 127, sub-frame Iq may contain 
frequency coefficients 128 through 255, and sub-frame 

7o may contain frequency coefficients 896 through 1023. 

[0043] Likewise, the next frame, framei may also comprise 
8 sub-frames organized as follows: sub-frame Oi may 
contain frequency coefficients 0 through 127, sub-frame li 
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may contain frequency coefficients 128 through 255, 
and sub- frame 7i may contain frequency coefficients 896 
through 1023 . 

[0044] The frames may be divided into sub- frames based 
upon the block type of that particular frame. In the 
case of an AAC algorithm, for a short block type, the 
frame may be divided into 8 sub frames. Each of these 
sub-frames may further comprise sub-bands 170A, Each of 
these sub-bands may comprise a variable number of 
frequency coefficients. For example, sub- frame Oq may 
comprise 14 sub-bands organized as follows: sub-band Ooo 
may contain frequency coefficients 0 through 3, sub-band 
Ooi may contain frequency coefficients 4 through 7, and 
sub-band O013 may contain frequency coefficients 112 
through 127. 

[0045] Interleaving may comprise a re-associating of sub- 
bands containing similar audio translatable frequencies. 
For example, each of the sub-bands of sub- frame Oo may be 
associated adjacent to the corresponding sub-bands of 
sub-frame lo 180. Interleaving the samples decreases the 
effective number of bits required to be transmitted, 
because a single set of scale factors may be transmitted 
for the set of frequency coefficients formed by the 
association of all of the sub-bands in adjacent sub- 
frames. The number of sub-frames that may be associated 
together may also vary in a frame. For example, in 
Figure lA, two sub-frames have been associated (grouped) 
together . 

[0046] Interleaving of audio samples promotes sound 
quality by associating similar audio translatable 
frequencies across sub- frames in a frame. The number of 
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bits required to be transmitted to recreate audio time 
samples may be reduced. 

[0047] Referring now to Figure 2, there is illustrated a 
block diagram describing an exemplary AAC decoder 300 in 
accordance with an embodiment of the present invention. 
Once the frame synchronization is found, the AAC 
bitstream may be de-multiplexed by a bitstream de- 
multiplexer 305. 

[0048] This may include Huffman decoding 310, scale 
factor decoding 315, and decoding of side information 
used in tools, such as, Mid/Side decoding (M/S) 320, 
prediction decoding 323, intensity stereo 325, TNS 330, 
and filter bank 335. The sub- frames containing audio 
translatable frequency coefficients may be decoded and 
copied to an output buffer. 

[0049] After Huffman decoding 310, an inverse quantizer 
340 may inverse quantize each sub-frame of audio 
translatable frequency coefficients by a 4/3 power 
nonlinearity . The scale factors 315 may then be used to 
scale sets of frequency coefficients MDCTq. . .MDCTn by the 
quantizer step size. 

[0050] Additionally, tools including the M/S decoding 
320, intensity stereo 325, TNS 330, and the filter bank 
335 may apply further functions to the sets of frequency 
coefficients MDCTq. . .MDCTn. Finally, the gain control 350 
may transform the frequency coefficients MDCTq ... MDCTn 
into the time domain signal A(t) . The gain control 350 
may transform the frequency coefficients by application 
of the Inverse MDCT (IMDCT) , the inverse window function, 
window overlap, and window adding. The audio signal may 
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be transmitted to a remote device for playback or stored 
for later playback. When playback is desired, the audio 
signal may be de-multiplexed and decoded. 

[0051] Referring now to Figure 3, there is illustrated a 
block diagram 600 describing the de-interleaving process 
using plural memory devices according to an embodiment of 
the present invention. The de- interleaving process may 
be carried out using a pair of SRAM (static random access 
memory devices) memory units 610,620 and a DRAM (dynamic 
random access memory devices) memory unit 630. The audio 
stream may also be configured such that the left and 
right audio channels may be de-interleaved separately. 
The de-interleaving process may be carried out after the 
frequency coefficients MDCTo . . . MDCTn have been scaled, 
using the scale factors, by the quantizer step size. 

[0052] Initially, a portion of an audio stream comprising 
interleaved left channel audio frequency coefficients may 
be written into a first static memory device 610. The 
interleaved left channel audio frequency coefficients may 
then be de-interleaved by a de- interleaving engine 640. 
The de- interleaved left channel audio frequency 
coefficients may be written to a second SRAM memory 
device 620 as they are de-interleaved. 

[0053] When all of the left channel audio frequency 
coefficients have been de-interleaved, the first static 
memory device 610 contains the interleaved audio 
information and the second static memory device 620 
contains the de- interleaved audio information. 

[0054] In order to make room for de- interleaving the 
right channel audio frequency coefficients, the de- 
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interleaved left channel audio frequency coefficients may 
be written to a DRAM dynamic memory device 630 for 
temporary storage. The interleaved left channel audio 
frequency coefficients in the first SRAM 610 are 
overwritten by the interleaved right channel audio 
frequency coefficients. 

[0055] Similar to the de- interleaving of the left channel 
information, the interleaved right channel audio 
frequency coefficients may be de-interleaved by the de- 
interleaving engine. When the interleaved right channel 
audio frequency coefficients have been de-interleaved, 
the first SRAM 610 contains the interleaved right channel 
information, the second SRAM contains the de- interleaved 
right channel information, and the DRAM 630 dynamic 
memory device contains the left channel de- interleaved 
information. 

[0056] The DRAM dynamic memory device 630, while 
appropriate for temporary information storage may not be 
appropriate for further audio decoding processes, whereas 
the static memory devices 610, 620 may be better suited 
to continued audio decoding processes. The de- 

interleaved left channel audio frequency coefficients may 
therefore be sent from the DRAM dynamic memory device 630 
to the first SRAM static memory device 610, i.e., the 
interleaved right channel audio frequency information in 
the first SRAM static memory device 610 is overwritten by 
the de- interleaved left channel audio frequency 
information . 

[0057] Referring now to Figure 4, there is illustrated a 
block diagram 660 describing a memory arrangement for 
decoding de-interleaved audio frequency information. In 
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Figure 4, the first and second static memory devices 610^ 
620 may contain the de-interleaved left and right channel 
audio frequency information, respectively, and the 
dynamic memory device 630 may contain no information. 

[0058] The de-interleaved right and left channel audio 
frequency information pass through the decoding stages 
such as M/S decoding. Intensity coupling decoding, TNS, 
and filter bank, individually, generating the final PCM 
samples for the left and right channels respectively in 
audio decoder 666. 

[0059] Referring now to Figure 5, there is illustrated a 
flow diagram 500 for a method of de-interleaving plural 
audio channel information using a minimum number of 
memory devices. In Figure 5, in block 510, the method 
discloses writing interleaved left channel audio 
information to a first SRAM static memory device. 

[0060] In block 520, the interleaved left channel audio 
information may be de- interleaved by a de- interleaving 
engine and written 530 into a second SRAM static memory 
device. The de-interleaved left channel information may 
be written 540 to a DRAM dynamic memory device for 
temporary storage and the interleaved left channel audio 
information may be overwritten in the first SRAM static 
memory device by the interleaved right channel audio 
information 550 . 

[0061] The interleaved right channel audio information 
may be de-interleaved by a de-interleaving engine 560 and 
the de- interleaved right channel audio information may be 
written to the second SRAM static memory device 570. The 
interleaved right channel audio information in the first 
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static memory device may then be overwritten by the de- 
interleaved left channel audio information from the DKM/L 
dynamic memory device 580. The decoding process 

continues while the audio transmission continues 590. 

[0062] Thus, the de-interleaved left channel audio 
information may be contained in the first SRAM static 
memory device and the de-interleaved right channel audio 
information may be contained in the second SRAM static 
memory device. The de-interleaved left and right channel 
audio information may be further decoded into left and 
right channel PCM data by passing through additional 
decoding stages, such as, M/S decoding. Intensity 
coupling decoding, TNS, and filter bank, individually. 

[0063] The methods and systems described herein may be 
implemented as a board level product, as a single chip, 
application specific integrated circuit (ASIC) , or within 
varying levels of the decoder system and/or integrated 
with other portions of the system as separate components. 

[0064] The degree of integration of the decoder system 
will primarily be determined by speed and cost 
considerations. Because of the sophisticated nature of 
modern processors, it is possible to utilize a 
commercially available processor, which may be 
implemented external to an ASIC implementation. 

[0065] Alternatively, if the processor is available as an 
ASIC core or logic block, then a commercially available 
processor can be implemented as part of an ASIC device 
wherein certain operations are implemented in firmware. 
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[0066] While the invention has been described with 
reference to certain embodiments, it will be understood 
by those skilled in the art that various changes may be 
made and equivalents may be substituted without departing 
from the scope of the invention. 

[0067] In addition, many modifications may be made to 
adapt a particular situation or material to the teachings 
of the invention without departing from its scope. 
Therefore, it is intended that the invention not be 
limited to the particular embodiment ( s ) disclosed, but 
that the invention will include all embodiments falling 
within the scope of the appended claims. 



